智能论文笔记

EBHI-Seg: A Novel Enteroscope Biopsy Histopathological Haematoxylin and Eosin Image Dataset for Image Segmentation Tasks

Liyu Shi , Xiaoyan Li , Weiming Hua , Haoyuan Chen , Jing Chen , Zizhen Fan , Minghe Gao , Yujie Jing , Guotao Lu , Deguo Ma

分类：计算机视觉

2022-12-01

Background and Purpose: Colorectal cancer is a common fatal malignancy, the fourth most common cancer in men, and the third most common cancer in women worldwide. Timely detection of cancer in its early stages is essential for treating the disease. Currently, there is a lack of datasets for histopathological image segmentation of rectal cancer, which often hampers the assessment accuracy when computer technology is used to aid in diagnosis. Methods: This present study provided a new publicly available Enteroscope Biopsy Histopathological Hematoxylin and Eosin Image Dataset for Image Segmentation Tasks (EBHI-Seg). To demonstrate the validity and extensiveness of EBHI-Seg, the experimental results for EBHI-Seg are evaluated using classical machine learning methods and deep learning methods. Results: The experimental results showed that deep learning methods had a better image segmentation performance when utilizing EBHI-Seg. The maximum accuracy of the Dice evaluation metric for the classical machine learning method is 0.948, while the Dice evaluation metric for the deep learning method is 0.965. Conclusion: This publicly available dataset contained 5,170 images of six types of tumor differentiation stages and the corresponding ground truth images. The dataset can provide researchers with new segmentation algorithms for medical diagnosis of colorectal cancer, which can be used in the clinical setting to help doctors and patients.

translated by 谷歌翻译

Segmentation of Weakly Visible Environmental Microorganism Images Using Pair-wise Deep Learning Features

Frank Kulwa , Chen Li , Marcin Grzegorzek , Md Mamunur Rahaman , Kimiaki Shirahama , Sergey Kosov

分类：计算机视觉

2022-08-31

环境微生物（EMS）的使用通过监测和分解污染物提供了高效，低成本和无害的环境污染补救措施。这取决于如何正确分段和确定EMS。为了增强透明，嘈杂且对比度较低的弱可见EM图像的分割，在本研究中提出了成对深度学习功能网络（PDLF-NET）。 PDLFS的使用使网络通过将每个图像的成对深度学习特征与基本模型Segnet的不同块相连，从而使网络更加关注前景（EMS）。利用shi和tomas描述符，我们在贴片上提取每个图像的深度特征，这些图像使用VGG-16模型以每个描述符为中心。然后，为了学习描述符之间的中间特征，基于Delaunay三角定理进行功能的配对以形成成对的深度学习特征。在该实验中，PDLF-NET可实现89.24％，63.20％，77.27％，35.15％，89.72％，91.44％和89.30％的出色分割结果，分别为IOU，DICE，DICE，VOE，灵敏度，精确性和特定性，精确性和特定性，精确性和特定性，精确性和特定性。

translated by 谷歌翻译

HTML版本

IL-MCAM: An interactive learning and multi-channel attention mechanism-based weakly supervised colorectal histopathology image classification approach

Haoyuan Chen , Chen Li , Xiaoyan Li , Md Mamunur Rahaman , Weiming Hu , Yixin Li , Wanli Liu , Changhao Sun , Hongzan Sun , Xinyu Huang

分类：计算机视觉

2022-06-07

近年来，大肠癌已成为危害人类健康最重要的疾病之一。深度学习方法对于结直肠组织病理学图像的分类越来越重要。但是，现有方法更多地集中在使用计算机而不是人类计算机交互的端到端自动分类。在本文中，我们提出了一个IL-MCAM框架。它基于注意机制和互动学习。提出的IL-MCAM框架包括两个阶段：自动学习（AL）和交互性学习（IL）。在AL阶段，使用包含三种不同注意机制通道和卷积神经网络的多通道注意机制模型用于提取多通道特征进行分类。在IL阶段，提出的IL-MCAM框架不断地将错误分类的图像添加到交互式方法中，从而提高了MCAM模型的分类能力。我们对数据集进行了比较实验，并在HE-NCT-CRC-100K数据集上进行了扩展实验，以验证拟议的IL-MCAM框架的性能，分别达到98.98％和99.77％的分类精度。此外，我们进行了消融实验和互换性实验，以验证三个通道的能力和互换性。实验结果表明，所提出的IL-MCAM框架在结直肠组织病理学图像分类任务中具有出色的性能。

translated by 谷歌翻译

An application of Pixel Interval Down-sampling (PID) for dense tiny microorganism counting on environmental microorganism images

Jiawei Zhang , Ning Xu , Chen Li , Md Mamunur Rahaman , Yu-Dong Yao , Yu-Hao Lin , Jinghua Zhang , Tao Jiang , Wenjun Qin , Marcin Grzegorzek

分类：计算机视觉 | 人工智能

2022-04-04

本文提出了一个新颖的像素间隔下采样网络（PID-NET），以较高的精度计算任务，以更高的精度计数任务。 PID-NET是具有编码器架构的端到端卷积神经网络（CNN）模型。像素间隔向下采样操作与最大功能操作相连，以结合稀疏和密集的特征。这解决了计数时茂密物体的轮廓凝结的局限性。使用经典分割指标（骰子，Jaccard和Hausdorff距离）以及计数指标进行评估。实验结果表明，所提出的PID-NET具有最佳的性能和潜力，可以实现密集的微小对象计数任务，该任务在数据集上具有2448个酵母单元图像在数据集上达到96.97 \％的计数精度。通过与最新的方法进行比较，例如注意U-NET，SWIN U-NET和TRANS U-NET，提出的PID-NET可以分割具有更清晰边界和较少不正确的碎屑的密集的微小物体，这表明PID网络在准确计数的任务中的巨大潜力。

translated by 谷歌翻译

EMDS-6: Environmental Microorganism Image Dataset Sixth Version for Image Denoising, Segmentation, Feature Extraction, Classification and Detection Methods Evaluation

Peng Zhao , Chen Li , Md Mamunur Rahaman , Hao Xu , Pingli Ma , Hechen Yang , Hongzan Sun , Tao Jiang , Ning Xu , Marcin Grzegorzek

分类：计算机视觉

2021-12-14

环境微生物（EMS）在我们周围普遍存在，对人类社会的生存和发展产生了重要影响。然而，对环境微生物（EM）数据的高标准和严格要求导致现有相关数据库的不足，更不用说具有GT图像的数据库。这个问题严重影响了相关实验的进展。因此，本研究开发了环境微生物数据集第六版（EMDS-6），其中包含21种EMS。每种类型的EM包含40个原件和40 GT图像，总共1680个EM图像。在这项研究中，为了测试EMDS-6的有效性。我们选择图像处理方法的经典算法，例如图像去噪，图像分割和目标检测。实验结果表明，EMDS-6可用于评估图像去噪，图像分割，图像特征提取，图像分类和对象检测方法的性能。

translated by 谷歌翻译

GasHisSDB: A New Gastric Histopathology Image Dataset for Computer Aided Diagnosis of Gastric Cancer

Weiming Hu , Chen Li , Xiaoyan Li , Md Mamunur Rahaman , Jiquan Ma , Yong Zhang , Haoyuan Chen , Wanli Liu , Changhao Sun , Yudong Yao

分类：计算机视觉

2021-06-04

背景和目的：胃癌已经成为全球第五次常见的癌症，早期检测胃癌对于拯救生命至关重要。胃癌的组织病理学检查是诊断胃癌的金标准。然而，计算机辅助诊断技术是挑战，以评估由于公开胃组织病理学图像数据集的稀缺而评估。方法：在本文中，公布了一种贵族公共胃组织病理学子尺寸图像数据库（GashissdB）以识别分类器的性能。具体地，包括两种类型的数据：正常和异常，总共245,196个组织案例图像。为了证明图像分类领域的不同时期的方法在GashissdB上具有差异，我们选择各种分类器进行评估。选择七种古典机器学习分类器，三个卷积神经网络分类器和新颖的基于变压器的分类器进行测试，用于测试图像分类任务。结果：本研究采用传统机器学习和深入学习方法进行了广泛的实验，以证明不同时期的方法对GashissdB具有差异。传统的机器学习实现了86.08％的最佳精度率，最低仅为41.12％。深度学习的最佳准确性达到96.47％，最低为86.21％。分类器的精度率显着变化。结论：据我们所知，它是第一个公开的胃癌组织病理学数据集，包含大量的弱监督学习的图像。我们认为Gashissdb可以吸引研究人员来探索胃癌自动诊断的新算法，这可以帮助医生和临床环境中的患者。

translated by 谷歌翻译

Is the aspect ratio of cells important in deep learning? A robust comparison of deep learning methods for multi-scale cytopathology cell image classification: from convolutional neural networks to visual transformers

Wanli Liu , Chen Li , Md Mamunur Rahamana , Tao Jiang , Hongzan Sun , Xiangchen Wu , Weiming Hu , Haoyuan Chen , Changhao Sun , Yudong Yao

分类：计算机视觉

2021-05-16

宫颈癌是女性中一种非常常见和致命的癌症类型。细胞病理学图像通常用于筛选这种癌症。鉴于在手动筛查期间可能发生许多错误，已经开发了一种基于深度学习的计算机辅助诊断系统。深度学习方法需要输入图像的固定维度，但临床医学图像的尺寸不一致。图像的纵横比在直接调整它们的同时受到影响。临床上，细胞病理学图像内的细胞的纵横比为医生诊断癌症提供重要信息。因此，很难直接调整大小。然而，许多现有研究直接调整了图像的大小，并获得了高度稳健的分类结果。为了确定合理的解释，我们进行了一系列比较实验。首先，预处理SipakMed数据集的原始数据以获得标准和缩放数据集。然后，将数据集调整为224 x 224像素。最后，22种深度学习模型用于分类标准和缩放数据集。该研究的结果表明，深度学习模型对宫颈细胞病理学图像中细胞的纵横比变化是鲁棒的。此结论也通过Herlev DataSet验证。

translated by 谷歌翻译

GasHis-Transformer: A Multi-scale Visual Transformer Approach for Gastric Histopathology Image Classification

Haoyuan Chen , Chen Li , Xiaoyan Li , Ge Wang , Weiming Hu , Yixin Li , Wanli Liu , Changhao Sun , Yudong Yao , Yueyang Teng

分类：计算机视觉

2021-04-29

现有的胃癌诊断深层学习方法，常用卷积神经网络。最近，视觉变压器由于其性能和效率而引起了极大的关注，但其应用主要在计算机视野领域。本文提出了一种用于Gashis变压器的多尺度视觉变压器模型，用于胃组织病理学图像分类（GHIC），其使微观胃图像自动分类为异常和正常情况。 GASHIS-COMPURANCER模型由两个关键模块组成：全球信息模块和局部信息模块有效提取组织病理特征。在我们的实验中，具有280个异常和正常图像的公共血毒素和曙红（H＆E）染色的胃组织病理学数据集分为训练，验证和测试组，比率为1：1：2胃组织病理学数据集测试组精度，召回，F1分数和准确性分别为98.0％，100.0％，96.0％和98.0％。此外，进行了关键的研究以评估Gashis变压器的稳健性，其中添加了10个不同的噪声，包括四种对抗性攻击和六种传统图像噪声。此外，执行临床上有意义的研究以测试Gashis变压器的胃肠癌鉴定性能，具有620个异常图像，精度达到96.8％。最后，进行比较研究以测试在淋巴瘤图像数据集和乳腺癌数据集上的H＆E和免疫组织化学染色图像的概括性，产生可比的F1分数（85.6％和82.8％）和精度（83.9％和89.4％），分别。总之，Gashistransformer演示了高分类性能，并在GHIC任务中显示出其显着潜力。

translated by 谷歌翻译

Defense Against Adversarial Attacks on Audio DeepFake Detection

Piotr Kawa , Marcin Plata , Piotr Syga

分类：机器学习

2022-12-30

Audio DeepFakes are artificially generated utterances created using deep learning methods with the main aim to fool the listeners, most of such audio is highly convincing. Their quality is sufficient to pose a serious threat in terms of security and privacy, such as the reliability of news or defamation. To prevent the threats, multiple neural networks-based methods to detect generated speech have been proposed. In this work, we cover the topic of adversarial attacks, which decrease the performance of detectors by adding superficial (difficult to spot by a human) changes to input data. Our contribution contains evaluating the robustness of 3 detection architectures against adversarial attacks in two scenarios (white-box and using transferability mechanism) and enhancing it later by the use of adversarial training performed by our novel adaptive training method.

translated by 谷歌翻译

Fast-moving object counting with an event camera

Kamil Bialik , Marcin Kowalczyk , Krzysztof Blachut , Tomasz Kryjak

分类：计算机视觉

2022-12-16

This paper proposes the use of an event camera as a component of a vision system that enables counting of fast-moving objects - in this case, falling corn grains. These type of cameras transmit information about the change in brightness of individual pixels and are characterised by low latency, no motion blur, correct operation in different lighting conditions, as well as very low power consumption. The proposed counting algorithm processes events in real time. The operation of the solution was demonstrated on a stand consisting of a chute with a vibrating feeder, which allowed the number of grains falling to be adjusted. The objective of the control system with a PID controller was to maintain a constant average number of falling objects. The proposed solution was subjected to a series of tests to determine the correctness of the developed method operation. On their basis, the validity of using an event camera to count small, fast-moving objects and the associated wide range of potential industrial applications can be confirmed.

translated by 谷歌翻译